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PREFACE 

GUIDE  TO  THE  PROGRAM  DOCUMENTATION 
This  volume  contains: 

CALLING  SEQUENCES  ' 

A brief  description  of  the  major  file  management  and  text 
buffering  routine  is  provided.  The  description  includes  the 
required  arguments  for  the  subroutines,  error  conditions,  and 
a list  of  subroutines  called  by  the  described  routine. 

FLOW  CHARTS 

A flow  chart  is  provided  for  each  of  the  user  commands  avail- 
able in  VEDIT  and  RECORD.  These  charts  assume  knowledge  of 
system  operation,  as  described  in  the  user  manual  (Volume  III) . 
Since  copious  reference  is  made  to  the  listings,  they  should 
also  be  consulted. 

LISTINGS 

The  program  listings  are  found  elsewhere.  A running  commen- 
tary is  provided  in  addition  to  a short  description  of  each 
program  module.  An  index  for  the  listings  is  found  in  Section  3 
of  this  volume.  The  first  part  of  the  index  gives  the  program 
names  followed  by  the  name  of  all  source  modules  required  to 
assemble  the  program.  The  second  part  gives  the  source  module 
name  followed  by  the  name  of  all  subroutines  in  that  module. 

LINKING  CONVENTION 

The  system  linking  programs  can  be  found  in  the  section  de- 
scribing the  module  STKBUF . The  conventions  used  are  described 
as  an  aid  to  understanding  the  attached  flow  charts. 

Subroutines  in  the  VRS  have  two  possible  returns.  The  first  is 
the  normal  return.  Execution  continues  after  the  call  as  nor- 
mally would  be  expected  from  a subroutine  return.  The  second 
is  called  an  "error"  return.  This  return  is  specified  by  pro- 
viding an  address  the  program  should  continue  at,  should  the 
error  return  be  taken. 

It  is  important  to  note  that  the  error  return  can  mean  one  of 
two  things:  an  actual  error  may  have  occurred  in  the  system, 

3uch  as  an  attempt  to  write  the  disk  with  the  write-lock  on; 
or,  an  error  return  can  result  from  a test  which  fails.  For 
example,  DCTBM  is  a routine  to  find  the  entry  in  the  dictionary- — 
which  best  matches  the  input  string.  If  no  match  occurs. 
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however,  the  error  return  is  taken.  This  does  not  indicate 
a program  error,  but  rather  is  used  to  indicate  that  the 
input  string  does  not  match  any  entry  in  the  dictionary.  In 
the  case  of  making  sure  that  a new  entry  does  not  already 
exist,  the  error  return  is  actually  the  desirable  return  for 
DCTBM. 

The  program  flow  charts  indicate  the  error  return  by  an  arrow 
leaving  the  subroutine  call,  which  is  labelled  "error". 

Again,  this  always  indicates  the  error  return  as  described 
above.  Of  course,  the  error  return  may  be  the  desired  return, 
thus,  the  reader  should  consult  the  listing  or  the  calling 
sequence  description  to  determine  theconditions  which  lead 
to  the  error  return. 
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1.  INTRODUCTION 
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This  documentation  describes  the  operation  of  the 
programs  comprising  the  single  channel  Voice  Response 
System  (VRS)  delivered  under  contract  DOT/TSC  1107. 
The  descriptions  include  a short  narrative  of  each 
module  and  corresponding  flow  charts.  The  program 
listings  are  furnished  under  separate  cover. 


1.1  HARDWARE  ENVIRONMENT 

The  single  channel  VRS  system  requires  a PDP-11/20 
computer  configured  as  shown  in  Figure  1.  At  least 
12K  core  memory  is  required. 


1.2  SOFTWARE  ENVIRONMENT 

The  programs  described  run  under  control  of  the  DEC 
RT-11  operating  system,  version  2.  The  single  job 
(SJ)  monitor  is  used.  There  are  three  main  programs 
callable  from  the  monitor  level  as  follows.  Each 
program  is  treated  in  detail  in  subsequent  sections. 


1.2.1  VEDIT 

Program  VEDIT  comprises  all  modules  required  to  create, 
modify  and  update  the  dictionary  which  maps  ASCII  names 
to  the  disk  resident  voice  files.  VEDIT  contains  a 
command  string  interpreter  (CSI)  for  reacting  to  user 
input.  A complete  description  of  VEDIT  and  other  program 
commands  is  given  in  the  user  manual. 


1.2.2  RECORD 

Program  RECORD  comprises  all  modules  required  to  enter 
audio  speech  utterances  into  the  system.  It  contains 
modules  which:  (1)  accept  audio  input  and  digitize  it 
into  a temporary  file;  (2)  process  the  file  by  the 
ADPCM  algorithm;  (3)  auto-edit  leading  and  trailing 
silence;  and  (4)  associate  each  utterance  with  a 
dictionary  entry  and  build  the  disk-based  voice  file. 
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1.2.3 


SPEAK 

Program  H516  comprises  all  modules  required  to 
generate  audio  from  the  voice  files.  The  modules 
accept  ASCII  text  from  the  H516  computer,  search 
for  the  disk  blocks  containing  the  voiced  text, 
and  decode  the  voice  file  into  the  audio  signal 
which  currently  drives  a speaker.  H516  contains 
the  routines  for  parsing  the  input  text,  including 
insertion  of  pauses,  identification  of  the  best 
text  match  using  phrase  look-ahead,  and  proper 
interpretation  of  numbers. 


1.2.4  System  Subroutines 

AH  programs  make  extensive  use  of  shared  subroutines 
which  perform  specific  tasks,  such  as  buffer  manage- 
ment, pattern  matching,  register  saving,  etc.  All 
subroutine  modules  of  general  interest  are  documented 
fully,  and  those  routines  unlikely  to  be  frequently 
required  have  a brief  description  of  the  routine  as 
well  as  its  calling  parameters.  All  routines  are 
identified  in  the  program  listings. 

1.2.5  Data  Base 

The  dictionary  and  voice  file  are  stored  in  a single 
disk  file:  " DIRECT. DVF" . The  generation  and  manage- 
ment of  the  data  base  is  described  below. 
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VEDIT 

VEDIT  is  entered  through  the  global  symbol  "CSI".  Upon 
entry  the  program  version  number  is  printed  and  the  system 
initialization  routine  is  called.  The  system  initializa- 
tion routine  opens  file  DIRECT. DVF  if  it  exists  or  creates 
a new  one  if  it  does  not.  If  an  error  occurs  in  system 
initialization,  the  program  exits,  since  the  program  is 
unable  to  proceed  without  proper  initialization.  If  no 
error  occurs,  the  main  loop  is  entered. 

Command  input  occurs  first  in  the  main  loop.  A question 
mark  is  printed  and  type-in  is  accepted  from  the  console 
terminal.  The  command  input  routine  does  not  return  until 
either  a new  character  is  entered  or  until  an  error  occurs. 
The  only  error  likely  to  occur  is  an  attempt  by  the  user 
to  input  a command  which  is  more  than  255  characters  long. 

The  command  entered  is  now  looked  up  in  a table  of  commands 
The  command  is  matched,  character  for  character,  with  com- 
mand names  in  the  table  until  the  first  break  character. 
Only  enough  of  the  command  name  need  be  typed  to  prevent 
matching  more  than  one  command.  When  the  command  is 
matched,  the  command  string  is  also  checked  for  any  switch 
options  to  the  command.  If  a switch  exists,  the  switch 
character  is  put  in  the  global  variable  "SWITCH".  Finally, 
a pointer  to  the  matching  routine  is  returned. 

The  routine  selected  by  command  lookup  is  executed.  The 
remainder  of  the  entered  command  is  passed  to  the  routine 
for  use  as  an  argument  list.  The  command  routine  also  has 
access  to  SWITCH  and  other  globals  in  the  program  to  insure 
its  proper  execution,  upon  completion  of  the  routine,  the 
remainder  of  the  command  string  not  used  is  ignored  and  the 
disk  copy  of  file  DIRECT. DVF  is  updated  if  any  changes  to 
temporarily  core  resident  sections  have  been  made. 

Errors  occurring  are  of  two  types.  The  first  can  be  called 
"recoverable".  These  are  such  things  as  user  typographical 
errors  or  undefined  or  incorrect  arguments.  In  this  case  a 
message  is  printed  and  a carriage  return  line  feed  is 
printed.  The  second  are  errors  such  as  a failure  of  the 
initialization  routine.  Errors  of  this  type  indicate  hard- 
ware or  an  RT-11  operating  system  error.  The  program  it- 
self is  incapable  of  handling  such  errors  and  so  the  pro- 
gram exits. 

The  following  section  contains  flow  charts  of  the  basic  com' 
mands  callable  from  the  VEDIT  command  string  interpreter. 


The  description  of  the  commands  described  in  the  VRS  users 
manual  provides  guidelines  for  following  the  flow  charts. 

In  addition,  there  ere  descriptions  of  the  major  subroutines 
* given  at  the  end  of  this  section.  Consulting  the  program 

listings  is  also  helpful. 
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The  record  module  is  structurally  identical  to  VEDIT. 

The  command  language  is  limited  to  the  following: 

LISTEN  (for  building  a temporary  file  of 

digitized  speech) 

SAVE  (for  encoding  and  editing  the 

digitized  speech,  and  cataloguing 
utterances  on  the  disk  according 
to  dictionary  entries) 

No  switches  or  arguments  are  required. 

The  flow  charts  for  these  commands  follows. 
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2.3  SPEAK 

The  H516  module  is  flow  charted  on  the  following 
page.  ASCII  text  is  input  from  the  H516  computer. 
The  lookup  algorithm  described  separately  parses 
the  text,  using  look-ahead  techniques,  and  returns 
a list  of  pointers  to  voice  files  which  are  fed  to 
the  playback  routine. 

The  play  back  routine  is  identical  to  the  TALK 
routine  which  is  used  by  VEDIT.  The  appropriate 
flow  chart  can  be  seen  there. 
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Phrase  Look-ahead  Algorithm 


This  section  is  devoted  to  an  analysis  of  the  "phrase 
look-ahead"  function  of  the  H516  program.  Several 
problem  areas  will  be  dealt  with.  A familiarity  with  a 
standard  binary  search  is  assumed. 

v Consider  the  dictionary  listed  in  Table  2-1  and  the  text 

string  "NEW  YORK  CITY  IS  A LARGE  PLACE".  There  are  two 
problems  that  must  be  dealt  with.  First,  "NEW"  matches 
two  entries  - "NEW"  if  taken  alone  or  "NEW  YORK  CITY"  if 
taken  as  part  of  a phrase.  "NEW  YORK  CITY"  is  the  better 
match,  but  if  the  binary  search  arrived  at  "NEW"  first, 
a match  would  be  detected  and  the  search  would  terminate 

j • prematurely. 

The  solution  lies  in  the  fact  that  a phrase  which  matches 
a string  will  always  be  alphabetically  later  in  the  dic- 

T tionary  than  a word  or  shorter  phrase  which  matches  the 

same  string.  Therefore,  whenever  a match  occurs  before 
the  end  of  the  search,  (before  Log2N  tries  for  a diction- 
ary of  N entries) , the  matching  entry  is  stored.  The 
match  is  then  treated  as  a mismatch  which  is  less  than 
• the  string  input.  At  the  end  of  the  search  the  most 

recently  encountered  match  will  be  the  best  match. 

% The  second  problem  is  somewhat  more  subtle.  Consider 

again  the  dictionary  in  Table  2-1,  and  the  input  string 
"NEW  YORK  STATE".  The  first  try  with  the  binary  search 
would  compare  the  input  string  with  "LASTING"  after  which 
it  would  try  "NEW  JERSEY".  The  character  where  the  mis- 
match occurs  is  at  "Y"  in  the  input  and  "J"  in  Jersey. 

This  would  indicate  that  the  input  is  greater  than  the 
entry  "NEW  JERSEY"  so  the  binary  search  would  proceed 
to  "NEWARK".  A blank  is  alphabetically  less  than  an 
"A",  so  the  binary  search  would  next  try  "NEW  YORK  CITY". 
This  would  be  its  last  try.  But  the  "S"  in  "STATE"  mis- 
matches the  "C"  in  "CITY".  Therefore,  the  binary  search 
would  indicate  no  match  even  though  "NEW"  by  itself  does 
match  an  entry. 

The  problem  occurred  at  the  entry,  "NEW  JERSEY".  The 
comparison  indicated  that  the  input  is  greater  than  the 
dictionary  entry  because  "Y"  is  greater  than  "J".  But 
since  this  mismatch  occurred  after  a blank  was  encountered, 
the  matching  entry  will  be  greater  than  the  input  only 
if  a phrase  with  at  least  one  blank  will  be  the  final 
match.  If  the  final  match  has  no  imbedded  blanks,  then 
the  matching  entry  will  always  be  less  than  the  entry 
where  the  mismatch  occurred  after  a blank  was  encountered. 
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The  binary  search  will  go  in  the  wrong  direction  after  a mis- 
match if  several  conditions  occur.  The  first  condition  is  that 
the  number  of  blanks  encountered  before  the  mismatch  occurs  is 
greater  than  the  number  of  blanks  that  will  occur  in  the  best 

: >•  possible  match.  The  second  is  that  the  result  of  the  mismatch 

% indicates  that  the  matching  entry  is  greater  than  the  entry 

just  compared  with  the  input  string,  as  in  comparing  "NEW  YORK" 

; i.1  with  "NEW  JERSEY". 

[ i 

I i 

| 

The  solution  involves  a "tree  search"  of  the  dictionary  whenever 
no  blanks  are  encountered.  When  a mismatch  is  encountered  the 
binary  search  proceeds  as  normal.  If,  however,  one  or  more 
blanks  are  encountered  in  the  mismatch  and  the  mismatch  directs 
the  search  to  proceed  in  the  "greater  than"  direction,  the 
point  in  the  search  where  this  occurred  is  pushed  onto  the  stack 
and  then  the  search  is  permitted  to  continue  in  the  "greater 
than"  direction.  At  the  end  of  the  search  done  in  that  way,  the 
location  in  the  search  which  was  pushed  on  the  stack  is  popped 
off  and  the  search  proceeds  again  from  that  point  but  in  the 
"less  than"  direction.  As  described  in  the  first  problem  area, 
the  searches  are  allowed  to  continue  even  if  matches  are  en- 
countered. The  best  match  will  be  the  match  containing  the 
most  imbedded  blanks. 

This  can  occur  any  number  of  times,  and  several  of  these  de- 
cision points  can  be  on  the  stack  at  once.  Also  note  that  if 
'•  one  is  proceeding  from  a point  at  which  the  mismatch  contained 

one  blank  and  is  moving  in  the  "greater  than”  direction,  the 
further  mismatches  must  contain  two  or  more  blanks  before  they 
can  be  saved  on  the  stack.  This  simplifies  the  tree  search 
somewhat  and  increases  the  speed  of  the  search. 

A flow  chart  of  the  algorithm  aids  in  understanding  this  opera- 
tion. It  is  helpful  to  draw  a dictionary  as  a binary  tree  and 
use  it  to  trace  the  search  for  various  inputs  to  find  the  path 
followed . 


TABLE  2-1.  SAMPLE  DICTIONARY 


DENSE 
DENSE  AIR 
DENSE  FOG 
DENSE  SMOKE 
LAST  CHANCE 
LAST  ENTRY 
LAST  FLIGHT 
LASTING 
M 
N 

NEW 

NEW  JERSEY 
NEW  YORK  CITY 
NEWARK 
YORK 
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?ath  followed  through  dictionary  to  find  best  match  for  input  string  of  "DENSE  TRAFFIC 


(DENSE) I (DENSE  FOG)  (LAST  CHANCE)  (LAST  FLIGHT)  (M)  (NEW)  (NEW  YORK  CITY)  (YORK) 


3 . SYSTEM  SUBROUTINE 


The  following  section  provides  an  index  to  the  system 
subroutines.  It  consists  of  an  index  to  assembly  modules 
used  by  each  program  and  a list  of  subroutines  in  the 
order  they  appear  in  the  module  listings.  Subroutines 
which  are  of  general  interest  are  provided  with  a working 
description.  The  combination  of  this  index  and  the  pro- 
gram listings  provide  the  documentation  needed  for  pro- 
gram maintenance. 
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3 . 1 PROGRAM  ASSEMBLY  MODULE  NAMES 


Program 


VEDIT 


Assembly  modules 


VEDCSI 

STKBUF 

COMBLK 


DIRPAG 

ERRORS 


INSERT 

DELETE 

GARBG 


Program 


RECORD 


Assembly  modules 


RECCSI 
STKBUF 
LISTEN 
AD  PCM 


Program 

Assembly  modules 


GLOBAL 


STKBUF 
LOK  516 
PLY516 


SPGLBL 


Other  modules : 


PARAMS 

Contains  assembly  parameters 


3 . 2 SUBROUTINE  NAMES 


Module 

Subroutines 


Module 

Subroutines 


VEDCSI 

SYSINT 

CSI 

GCOM 

CLKUP 

CSMTCH 

SWTCHK 

FLBLK 

BRKCH 

TTYOT 

CRLF 

OCTIN 

OCTOUT 

LISTNM 

CNFRM 

AYS 

FCE 

STKBUF 

PUSH 

RTRN 

ERTRN 

BUFINT 

RIN 

LINSET 

RINC 

RBKUP 

CROUT 

BUFRST 

BUFSET 

CRBKUP 

ROUT 


system  initialize 

main  program  loop 

get  command 

command  table  lookup 

command  string  match 

legal  switch  checking  routine 

flush  leading  break  characters 

check  for  break  character 

console  output 

print  carriage  return  line  feed 

octal  number  input 

octal  number  output 

list  a dictionary  name 

confirm  a request  for  an 
operation 

"Are  you  sure?"  command  verify 
fatal  consistency  error 


stack  push 

normal  subroutine  return 
subroutine  error  return 
ring  buffer  initialize 
ring  buffer  in 
ring  buffer  limit  set 
input  with  limit  set 
input  pointer  backup 
conditional  output 
reset  conditional  output 
output  set 
conditional  backup 
ring  buffer  output 
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Module 


COMBLK 


Subroutines 


Module : 


Module 

Subroutines 


KILL  - dictionary  reinitialize 

BYE  - program  exit 

Also  contains  command  table  used  for 
command  lookup . 

CTAB 

contains  all  tables  used  by  both  ADPCM 
encode  and  decode . 

DIRPAG 


VMINT 

RDBYT 

WRBYT 

RDWRD 

WRWRD 

VMNG 

VMBKUP 

DIRINT 

RDDE 

WRDE 

RDFSE 

WRFSE 

NFSPTR 


"Virtual  memory"  initialize 

word  and  byte  i/o  on  dictionary 
via  "virtual  memory" 

virtual  memory  manager 

back  up  virtual  memory  or  disk 

directory  initialize 

directory  entry  read  and  write 

free  storage  entry  read  and  write 

get  next  entry  with  same  file 
pointer 


CRFSE  - create  free  storage  entry 
FSPACK  - pack  contiguous  free  storage 
DELFSE  - delete  free  storage  entry 

GUID  - generate  unique  identifier  for 
file 

CRDIR  - create  a dictionary  entry 

CRDCT  - create  a text  name  for  a 
dictionary  entry 

DELDE  - delete  dictionary  entry 

DCTBM  - get  best  match  in  dictionary 

DLKUP  - binary  search  dictionary  lookup 

DMTCH  - match  single  dictionary  entry 

STRMTC  - as  above  best  for  wild  card 
option 
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Module 


DIRPAG  (Continued) 


Subroutines 


Module 

Subroutines 

Module 

Subroutines 


Module 

Subroutines 


FLMTCH  - match  entry  with  full  command 
string 

ARGINT  - initialize  NXTARG  routine 

NXTARG  - return  successive  entries  in 

dictionary  which  match  command 
string 

Also  program  global  variables. 


ERRORS 

PRERR  - print  error  message 

PRCT  - print  current  token  in  command 

Also  error  messages  (ASCII  TEXT) 


LIST 


LIST  - 
LSTCHK  - 

LSTHDR  - 
LSTPRT  - 
LSTFS  - 


program  list  command 

check  arg  to  see  if  it  should 
be  listed 

print  listing  header 

print  file  name 

list  file  area  free  storage 


TALK 


TALK  - 
SPINT  - 
PBFINT  - 
FILBUF  - 
ADSTRT  - 


program  talk  command 

speaking  buffer  initialize 

buffer  set  up 

maintain  speaking  buffers 

start  of  A/D  convertor  (used  as 
clock) 

interrupt  service  and  ADPCM 
decode 


PBINT 


Module 

Subroutines 

Module 

Subroutines 

Module 

Subroutines 

Module 

Subroutines 


Module 

Module 

Subroutines 


DELETE 

DELETE  - 

INSERT 

INSERT 

SYNON 

RENAME 

ENTER 

GARBG 

GARBG  - 

RECCSI 

CSI 
GCCM 
CLKUP  - 
CSMTCH  - 
FLBLK  - 
BRKCH  - 
TTYOT  - 
CRLF 

PRERR  - 
PRCT 
AYS 
FCE 

GLOBAL 

Contains 

LISTEN 

LISTEN  - 
PRGINT  - 
BFRSTP  - 
DSKRCD  - 
ADINT  - 
PRGFIN  - 
GAIN 


program  DELETE  command 


program  commands 


program  command  for  free 
storage  "garbage  collections" 


program  main  loop 
get  command 
command  lookup 
command  string  match 
flush  blanks 

check  for  break  characters 
console  output 

print  carriage  return  line  feed 

print  error  message 

print  current  token 

"Are  you  sure?" 

fatal  consistency  error 

all  program  global  variables 


program  LISTEN  command 

initialize 

buffer  set  up 

disk  recording  routine 

program  interrupt  handler 

program  close 

program  command  to  set 
GAIN  on  A/D 
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Module 


ADPCM 


Subroutines 


ADPST  - 

SAVE  - 

WDPACK  - 

DOPEN  - 

FRALOC  - 

WRINT 

RDINT 

WRCLS  - 

WRTWAT 

REDWAT 

AUTORD 

ADPCM  - 

WBINS  - 

FBSQ  - 

PERMFL  - 

DCLOSE  - 

DELFSE  - 

NSFPTR  - 

THRESH  - 


initialize 

program  - main  loop 

code  word  packing 

dictionary  open  command 

free  storage  allocation 

disk  word  at  a time  i/o 
initialize 

close  write  channel 
word  at  a time  disk  i/o 

ADPCM  word  encode 

insert  sample  in  energy  "window 
2 

compute  (C ( 1 ) -7 . 5) 

make  a permanent  speech  file 

close  dictionary 

delete  free  storage  entry 

get  next  file  with  same  file 
pointer 

program  threshold  modification 


Module 


H516 


Subroutine 


Module 

Subroutines 


H516 

GLIN 

CPYLUD 

CPYPBD 

ALCCOR 

TTYLOT 

CRLF 

PRERR 

PRCT 

BRKCH 

LOK516 

PRSLIN 

PAUSE 

AUTOWR 

NUMCHK 

SINGLE 

PRSNUM 

PRS3DG 

PRSWRD 

PBLKUP 

PBMTCH 

WRINT 

RDINT 

WRCLS 

WRWAT 

RDWAT 

AUTORD 


program  main  loop 

input  time  of  ASCII  text 

copy  in  dictionary  for  lookup 

copy  in  dictionary  for  speaking 

allocate  free  core 

console  output 

carriage  return  - line  feed 

print  error 

print  current  token 

check  for  breaks 


look  up  text  in  a single  line 

pause  handler  for  punctuation 

write  pointer  into  ring  buffer 

check  if  text  string  is  a numeral 

check  if  token  is  a single 
character 

parse  up  a numeral 

parse  up  3 digits  of  a numeral 

spell  a word 

phrase  look  ahead  binary  search 
match  single  dictionary  entry 

initialize  word  at  a time  i/o 

close  write  channel 
write  disk  word  at  a time 
read  disk  word  at  a time 
auto  read  of  words 


r 


■ 


! ‘ 

: £■ 


! ? 


Module 

Subroutines 


Module 

Subroutines 


PLY516 

PLAYBK 

SPINT 

PBFINT 

FILBUF 

ADSTRT 

PBINT 


main  loop 

speaking  initialize 

buffer  initialize 

keep  buffers  full 

start  A/D  converter  (clock) 

interrupt  handler  and  ADPCM 
decode 

also  variables  and  tables 


SPGLBL 

Speak  program  global  variables 


3.3  SUBROUTINE  DESCRIPTION 


Macro : . CAL 

Function:  . CALl  subroutine.  Provides  uniform  format 

for  calling  subroutines  which  have  error 
return. 

Other  Operations:  .CAL  sub,  error. 

Expands  into: 

JSR  R7 , sub 
error 

"sub"  is  the  name  of  the  subroutine  to  be 
called . 


"error"  is  the  address  of  the  error  handler 
concerned  with  an  error  return  from  the  sub- 
routine . 


Name:  PUSH 

Function:  PUSH  all  registers  on  stack,  also  position 

R5 . 

Called:  JSR  R5,  PUSH 

Arguments:  None. 

Other  Operations:  Upon  return  from  PUSH,  user  stack  looks 

like  this: 


i 


Name : 

Function : 

Called: 

Arguments : 

Other  Operations: 


Name : 

Function: 

Called: 

Arguments : 

Other  Operations: 


t* 


RTRN 

RETURN  fiom  subroutine;  restores  registers 
from  stack  and  makes  a return,  skipping 
error  return  location  in  calling  routine. 

JMP  RTRN 

R6  need  not  point  to  stack. 

R5 , however,  must  point  as  shown  in  "PUSH" 
description . 

Stack  before  call  after 


RTRN  ADD 


SR5 

SR4 

SR3 

SR2 

SRI 

SR0 


<HR5) 


Popped 

off 

Stack 


RTRN  ADD 


SR5 

SR4 

SR3 


<-(R6) 

(R5)  = SR5 
( R4 ) = SR4 
(R3)  = SR3 
(R2)  = SR2 
(Rl)  = SRI 
(R0)  = SR0 


R6  points  (PC)  = RTRN  ADD  + 2 

anywhere 

beyond  SR0 


ERTRN 

Error  ReTuRN.  Return  from  subroutine  after 
restoring  registers.  Return  made  by  in- 
directing through  error  address  pointed  to 
by  return  address  in  stack. 

JMP  ERTRN 

Exactly  as  "RTRN". 

Exactly  as  RTRN  except: 

(PC)  = (RTRN  ADD) 
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RING  BUFFER  PACKAGE 


The  ring  buffer  package  for  the  VRS  is  a versatile  system  of 
routines  for  the  buffering  of  data  between  two  programs. 

Features  include  those  of  a normal  ring  buffer  as  well  as  the 
ability  to  examine  the  contents  of  the  buffer  without  actually 
removing  the  item  from  the  buffer,  as  would  be  desirable  for 
comparisons  at  an  input  stream  with  many  text  strings.  An 
additional  feature  is  a limit  pointer  useful  for  line-at-a- 
time  editing.  The  limit  pointer  can  be  set  to  the  point  where 
an  input  line  ends.  While  this  line  is  being  processed,  a new 
line  is  composed.  The  buffer  can  only  be  processed  up  to  the 
limit  pointer,  even  though  the  input  pointer  is  beyond  that 
point.  This  prevents  processing  on  a partially  composed  line. 

Consulting  Figure  3-1,  the  function  of  each  of  the  pointers 
is  as  follows: 

IN  - this  is  the  input  pointer  to  the  buffer.  As  data  is 
added,  the  pointer  moves  toward  out  (clockwise) . In- 
valid data  can  be  removed  by  "backing  up"  (counter  clock- 
wise) . The  limit  on  the  clockwise  direction,  is  the  OUT 
pointer.  At  that  point  the  buffer  is  full.  The  back  up 
limit  is  the  LIN  pointer. 

LIN  - this  is  the  input  and  output  limit  pointer.  It  is 
usually  pointed  to  the  last  new  line  character  in  the 
input  stream.  Once  set,  the  input  pointer  cannot  be 
backed  up  beyond  it,  nor  can  the  output  pointer  move 
forward  past  it;  that  is,  the  output  pointer  cannot 
remove  data  beyond  the  limit  pointer.  LIN  can  only 
move  clockwise.  It  is  set  to  the  position  of  IN  by  a 
subroutine  call,  and  remains  stationary  until  the  next 
call  regardless  of  the  motion  of  IN  or  OUT. 

OUT  - the  ring  buffer  output  pointer.  Removal  of  data  is 
"final"  in  the  sense  that  this  pointer  cannot  be  backed 
up.  It  sets  the  limit  up  to  which  IN  can  insert  data. 

It  is  either  moved  one  character  at  a time,  or  moved  by 
a subroutine  call  to  the  position  of  COUT. 

COUT  - this  pointer  allows  examination  of  any  data  between 
the  LIN  and  the  OUT  pointers  in  the  area  shown  on 
Figure  3-1.  It  can  move  one  character  at  a time  in  either 
direction  or  be  set  to  the  value  at  OUT. 
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Ring  Buffer  Package. 


motion  limit  for  IN 


POINTER  SUMMARY  ARROWS  INDICATE  DIRECTION  OF  MOTION. 
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Name:  BUFINT 

Functions:  BUFfer  INiTialize.  Set  up  a ring  buffer 

in  core. 

Called:  .CAL  BUFINT,  error. 

Inputs:  R0  pointer  to  262io  byte  block  of  storage 

to  be  used  as  a ring  buffer. 

Outputs:  None. 

Other  Operations:  256^g  byte  circular  buffer  is  initialized 

in  core.  6 byte  header  is  provided  as 
follows : 

Initial 


» 

B^te 

Name 

Function 

Value 

• 

ii 

• 

0 

LIN 

Line  Limit.  Sets 
limit  beyond  which 
output  pointers 
cannot  go. 

0 

1 

IN 

Input  pointer. 

Points  to  next  free 
byte  in  buffer. 

0 

2 

CFUL 

Conditional  full- 
ness . Distance 
from  conditional 
output  pointer  to 
output  pointer. 

0 

• 

3 

COUT 

Conditional  output 
pointer.  Free- 
flowing  pointer  to 
examine  any  charac- 
ter between  LIN  and 
OUT  without  removing 
from  buffer. 

0 

4 

FUL 

Actual  fullness. 
Distance  from  output 
pointer  to  input. 

0 
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Byte  Name  Function 


Initial 

Value 


t .» 

V 

E 


Vi 


[ I 

I i j i 

r' 


i. 

* 


5 OUT  Output  pointer.  0 

Final  output  pointer 
for  characters. 

Routines  called:  None. 

Errors:  None. 


Name : 

RIN 

Function : 

Ring  buffer  INput.  Places  input 
in  ring  buffer. 

character 

Called : 

•CAL  RIN,  error. 

# Args: 

Two . 

Inputs : 

R0  - pointer  to  ring  buffer 

R1  - character  to  be  inserted  right  justi- 
fied. 

Outputs : 

None . 

Other  Operations: 

Fullness  count  and  input  pointer 
mented  after  character  inserted, 
conditional  fullness  updated.) 

incre- 

(Also 

Routines  called: 

None. 

Errors : 

Buffer  full. 

VI 


Name : 

Function : 

Called : 

Inputs : 

Outputs : 

Other  Operations: 
Routines  Called: 
Errors : 


LINSET 

LIN  pointer  SET.  Limit  pointer  set  to 
current  value  at  input  pointer. 

.CAL  LINSET,  error. 

R0  - pointer  to  ring  buffer. 

None. 

Contents  of  IN  pointer  placed  in  LIN  pointer. 
None. 

None . 


Name : 
Function : 


Called : 
Inputs : 

Outputs : 
Results : 


RINC 

Ring  buffer  Input  with  No  Check.  Equivalent 
to  RIN  followed  by  LINSET.  Used  when  data 
need  not  be  checked  before  definitely  entering 
it. 

.CAL  RINC,  error. 

R0  - pointer  to  ring  buffer. 

R1  - character  to  be  inserted . 

None . 

If  error  - none. 


If  no  error,  character  placed  in  buffer,  con- 
ditional fullness  and  fullness  updated,  limit 
pointer  (LIN)  and  input  pointer  both  set  to 
next  character. 

Routines  Called:  None. 


Errors : 


Buffer  already  full. 
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Name : RBKUP 

Function:  Ring  buffer  BacKUP.  Removes  last  character 

placed  in  ring  buffer. 

•CAL  RBKUP,  error. 

Two. 

R0  - pointer  to  ring  buffer  used. 

SRI  - if  no  error,  returns  character  re- 
moved - if  error,  returns  unchanged. 

Other  Operations:  Pointer  positions  moved. 

Routines  Called:  None. 

Errors:  Pointer  already  backed  up  to  LIN  limit 

pointer. 


Called : 

# Args : 
Inputs : 
Outputs : 


Name : CROUT 

Function:  Conditional  Ring  buffer  OUTput . Remove 

next  character  from  buffer  using  the  condi- 
tional ring  buffer  pointer. 

Called:  • .CAL  CROUT,  error. 

# Args:  Two. 

Inputs:  R0  - pointer  to  ring  buffer. 

Outputs  SRI  - character  removed  from  buffer.  If 

error,  no  change. 

Other  Operations:  Conditional  pointer  incremented,  condi- 
tional fullness  decremented  if  no  error, 
otherwise  no  change. 

Routines  Called:  None. 

Errors:  Buffer  "empty". 

Conditional  output  pointer  COUT  has  caught 
up  to  limit  pointer  LIN. 
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r *• 


Name : 


BUFRST 


H 

i 

\ Function: 

I 

Called: 

# Args: 

Other  Operations: 

i ' 

Errors : 

iii  * 

x 

k 

• 

Name : 

Function: 

L 

< 

Called: 

# Args : 

Other  Operations: 
Errors : 


BUFfer  ReSeT.  Resets  conditional  output 
ana  conditional  fullness  to  value  of  output 
and  fullness. 

.CAL  BUFRST,  error. 

None. 

Covered  .in  function. 

None. 


BUFSET 

BUFfer  SET.  Moves  output  and  fullness  up 
to  conditional  output  and  conditional 
fullness . 

•CAL  BUFSET,  error. 

None . 

Covered  in  function. 

None . 


{ 
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Name : 


CRBKUP 


Function: 

Called: 

# Args: 

Inputs : 

Outputs : 

Routines  Called: 
Errors : 

Other  Operations: 


Condition  Ring  Buffer  output  BacKUP . 

Back  up  to  previous  character. 

.CAL  RBKUP , error. 

Two . 

R0  - pointer  to  ring  buffer. 

SRI  - character  removed  from  buffer. 

If  error,  SRI  is  unchanged. 

None . 

Attempt  to  back  up  beyond  output  pointer . 

Conditional  fullness  incremented  if  no 
error.  COUT  backed  up  one  character. 


Name : 
Function: 

Called : 

# Args. 
Inputs : 


ROUT 

Ring  buffer  OUTput.  Removes  next  character 
pointed  to  by  output  pointer. 

•CAL  ROUT,  error. 

Two . 

R0  - pointer  to  ring  buffer. 


Outputs:  R1  - no  error:  Character  removed. 

R1  - error:  Unchanged. 

Other  Operations:  Error  - none. 

No  error  - output  incremented,  fullness 
decremented . 


Routines  Called:  None. 


Errors : 


Attempt  to  move  output  pointer  past  con- 
ditional output  pointer. 


Name : 


VMINT 


Function : 


Called : 

Arguments : 

Other  Operations: 


Routines  called: 


Errors : 


Virtual  Memory  INiTialize . Sets  up 
buffers  for  disk  to  permit  read  and 
of  disk  resident  dictionary  through 
memory  system. 


core 
write 
a virtual 


.CAL  VMINT,  error. 


None . 


Core  Buffers  initialized  to  contain  core 
keys  of  first  five  pages  (256  words  each) 
of  the  dictionary.  Core  buffers  brought 
in  by  an  LRU  algorithm,  so  appropriate 
variabl6s  for  LRU  are  initialized. 


None . 

RT-11  monitor  calls  . READW 


RT-11  errors  only  *-  disk  read. 

Error  message  pointer  returned  in  ERPNTR. 
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Name : 

Function : 
Called : 
Arguments : 


Other  Operations: 

Routines  called: 
Errors : 


RDBYT  - read  one  byte 
RDWRD  - read  one  word 
WRBYT  - write  one  byte 
WRWRD  - write  one  word 

Basic  input-output  for  dictionary. 

.CAL  name,  error. 

R2  - points  to  word  or  byte  on  disk,  if 
it  is  a word,  it  should  point  to  an  even 
byte  boundary. 

R1  - argument  to  be  r*ad  or  written. 

Byte  arguments  should  be  right  justified. 

On  return  from  read  byte,  top  byte  cleared. 

LRU  variables  updated.  If  desired,  data 
not  in  core  at  time  of  call  is  swapped  in. 


VMNG 

RT-11  errors. 

Error  message  pointer  returned  in  ERPNTR. 


Name : 


VMBKUP 


f ^ 


Function:  Virtual  Memory  BacK  UP.  Updates  disk 

resident  copy  of  dictionary  by  swapping 
out  pages  in  core  buffers. 

Called:  .CAL  VMBKUP,  error. 

Arguments:  None. 

Other  Operations:  None  other  than  described  in  function. 

Routines  Called:  RT-11  monitor  calls  .WRITW 

Errors:  RT-11  errors. 

Error  message  pointer  returned. 


- ■■ ■ - w m 

r 


Name : 

Function: 

Called : 

Arguments : 

Other  Operations: 

Routines  Called: 
Errors : 


DIRINT 

PI Rectory  INiTialize . 

.CAL  DIRINT,  error. 

None . 

All  variables  in  header  at  dictionary  set 
to  indicate  empty  dictionary. 

CRFSE. 

Error  return  from  CRFSE  returned  directly 
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y 

VI  Function: 

Called : 
Arguments : 


Other  Operations: 
Routines  called: 

Errors: 

L 


RDFSE  - Read  free  storage  entry. 

WRFSE  - Write  free  storage  entry, 
i/o  on  2-word  free  storage  information. 
•CAL  Name,  error. 

R2  - byte  pointer  to  first  byte  of  two- 
word  descriptor. 

Callers  stack  at  call  time: 

(R6)  - address  of  block  of  free  storage. 
2 (R6)  - size  of  free  storage  in  number 
of  blocks. 

Dictionary  written  via  WRWRD. 

RDWRD 

WRWRD 

Returned  directly  from  above  routines. 


Name : 


RDDE  - read  directory  entry. 
WRDE  - write  directory  entry. 


r 

^ * . 

t M 

{ * 

t v 

; ♦*  • 

v 

: i Function: 

l 

l * 1 

Called: 

Arguments : 

■ • 

\ 

Other  Operations: 
Routines  called: 

I 

Errors : 


Read  or  write  a three-word  file  information 
block  from  dictionary. 

.CAL  name,  error. 

R2  - pointer  to  first  byte  at  entry  to  be 
accessed.  (Offset  from  beginning  of  dic- 
tionary.) Callers  stack  as  follows  (at 
call  time) : 

(R6)  - pointer  to  text  name  of  file. 

2(R6)  - file  size  information. 

4 (R6 ) - pointer  to  first  block  of  file. 

On  WRDE,  dictionary  written  through  virtual 
memory . 

WRWRD 

RDWRD 

Returned  directly  from  above  calls. 
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Name: 

Function: 

Called: 

Arguments : 

Other  Operations: 


Routines  Called: 

Errors : 


CRFSE 

Create  a two-word  entry  in  the  table  of 
free  storage  space. 

.CAL  CRFSE,  error. 

Stack  as  follows: 

(R6)  - address  of  disk  area  to  be  returned 
to  free  storage. 

2 (R6)  - size  in  blocks  of  disk  area. 

Free  storage  area  sorted  by  address.  Entries 
moved  to  accommodate  new  entry.  If  new 
entry  is  contiguous  with  existing  entry, 
the  entries  are  merged  into  one  entry. 
Pointers  to  table  of  free  storage  space  are 
updated . 

RDFSE 

WRFSE 

F3PACK 

Free  storage  space  full.  Error  pointer 
returned  in  ERPNTR. 

Other  errors  directly  returned  from  called 
routines . 
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Name : 


DELFSE 


Function: 

Called : 

Arguments : 

Other  Operations: 

Routines  Called: 

Errors : 

Name : 

Function: 

Called : 

Arguments : 

Other  Operation: 
Errors : 


Delete  FREE  storage  table  entry. 

.CAL  DELFSE,  error. 

R2  pointer  to  entry  to  be  removed. 

Table  of  free  storage  updated  by  moving 
remaining  entries  down  over  deleted  entry 
and  by  updating  pointers  to  table. 

RDFSE 

WRFSE 

Returned  directly  from  routines  called. 


GUID 

Generate  Unique  Identifier  used  in  file 
creation. 

.CAL  GUID,  error. 

R3  - high  order  byte  of  unique  identifier 
returned  in  low  order  byte  of  R3.  High 
order  byte  of  R3  cleared. 

R4  - low  order  two  bytes  of  uid. 

Current  uid  in  file  header  updated. 

None. 
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Name:  CRDIR 

Function:  CReate  DIctionaRy  entry.  Takes  text  argu- 

ment and  creates  a new  dictionary  entry. 

File  block  size  is  initially  zero.  Synonyms 
use  three  byte  unique  identifier  copied 
into  last  block  size,  and  file  pointer 
fields  of  entry. 

Called:  .CAL  CRDIR,  error. 

Arguments:  R0  - pointer  ring  buffer  containing  text 

name  for  new  entry.  First  call  to  CROUT 
should  return  first  character  for  name. 

Name  used  until  first  non-blank  break 
character . 

R2  - insertion  point  for  new  entry. 

Other  Operations:  Text  entry  inserted  after  previous  end  of 

text  area.  (File  name  area.)  New  three-word 
entry  inserted  in  dictionary.  Pointers 
to  dictionary  updated. 

Routines  Called:  CRDCT 

RDDE 

WRDE 

GU1D 

Errors:  Dictionary  full  (returned  in  ERPNTR)  or 

else  error  returned  from  routines  called. 
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Name : 

Function: 

Called: 

Arguments: 

Other  Operations : 
Routines  Called: 


CRDCT 

Inserts  text  name  into  file  name  area  of 
dictionary.  Returns  pointer  to  newly 
created  name. 

.CAL  CRDCT,  error. 

RJJ  - pointer  to  ring  buffer  containing  text 
name  described  in  CRDIR. 

R1  - returns  pointer  to  location.  Text  name 
was  inserted. 

File  name  area  and  appropriate  pointer  to 
that  area  are  updated. 

CROUT 

BRKCH 

WRBYT 


Errors : 


Dictionary  full  - returned  in  ERPNTR. 

Or  else  error  returned  from  routines  called. 


Name : 
Function: 


Called : 

Arguments : 

Other  Operations: 
Routines  Called: 
Errors : 


NFSPTR 

Find  next  entry  with  same  file  pointer.  Used 
to  locate  synonyms  to  a file.  When  called, 
look  for  synonym  which  alphabetically  follows 
after  entry  provided  as  argument.  If  search 
runs  past  end  of  dictionary,  restart  at  the 
beginning  of  the  dictionary.  If  an  entry  has 
no  synonyms,  return  original  entry. 

•CAL  NSFPTR,  error. 

On  entry  R2  is  pointer  to  three-word  entry 
block  in  dictionary; 

on  exit  R2  contains  pointer  to  three-word 
block  of  next  synonym. 

None. 

RODE . 

Returned  directly  from  RODE . 


V 

-I 


3 ' 


l. 


Name : DELDE 

Function:  Delete  dictionary  entry  - remove  three- 

word  block  associated  with  entry  and  the 

text  entry  name. 

Called:  .CAL  DELDE,  error. 

Arguments:  R2  - pointer  to  three-word  block  of  dic- 

tionary entry  to  be  removed. 

Other  Operations:  Text  entry  removed  from  file  name  area. 

Any  names  in  higher  core  than  removed  name 
are  moved  down  to  compress  file  name  area. 
The  same  is  done  for  the  information  block 
area.  All  pointers  are  updated.  LCHECK 
is  also  updated  if  entry  it  points  to  is 
moved  by  delete. 

Routines  Called:  RDDE 

RDBYT 

WRBYT 

RDWRD 

WRWRD 

Errors:  Returned  directly  from  routines  called. 


Name : 


DCTBM 


Function: 

Called : 
Arguments : 


Other  Operations: 
Routines  Called: 


Errors : 


Find  best  match  in  dictionary.  Determines 
if  string  provided  provides  a match  with  an 
entry  in  the  dictionary  up  until  the  first 
non-blank  break  character. 

.CAL  DCTBM,  error. 

R0  - points  to  ring  buffer  with  string  in 
it.  If  match  output  pointer  points  to  be- 
ginning of  string  with  leading  blanks  flushed, 
conditional  output  pointer  points  to  break 
chart»cter  terminating  string.  If  no  match, 
both  output  pointers  point  to  beginning  of 
string  with  leading  blanks  flushed. 

R2  - pointer  to  match.  If  match  occurs, 

R2  is  pointed  to  entry  in  dictionary  which 
matches  the  string.  If  no  match,  R2  points 
to  place  in  dictionary  where  new  entry  would 
be  inserted  if  input  string  were  used  for 
test  name.  If  end  of  text  encountered  in 
input  string  before  any  other  text  encountered, 
R2  is  set  to  all  ones. 

None . 

FLBLK 

DLKUP 

CRBKUP 

CRDUT 

BRKCH 

BUFRST 

Text  string  does  not  match  any  entries  or 
end  of  text  in  input  string. 

ERPNTR  unaffected  by  DCTBM.  If  set  by 
routines  called,  its  value  is  returned 
unchanged. 


Name : 
Function: 

Called : 
Arguments : 


Other  Operations: 
Routines  Called: 

Errors : 


DLKUP 

Dictionary  LOOKUP.  Performs  a binary  search 
on  dictionary  to  find  entry  which  matches 
input  string. 

•CAL  DLKUP,  error. 

R0  - points  to  text  string  in  ring  buffer. 

Both  output  pointers  must  point  to  the 
first  character  of  the  string.  If  no  match 
is  found,  output  pointers  are  unchanged.  If 
match  occurs,  conditional  output  pointer 
points  to  break  character  at  end  of  match. 

R2  - if  match  occurs  in  course  of  search, 

R2  points  to  matching  entry.  If  no  match, 

R2  points  to  insertion  point  found  for  string. 
If  end  of  text,  R2  is  set  to  all  ones. 

R3  - if  match  found,  R3  points  to  insertion 
point  for  string.  This  is  done  because  a 
string  which  matches  an  entry  may  continue 
beyond  the  match.  For  example,  "NEW  YORK 
CITY"  would  match  "NEW  YORK"  in  the  diction- 
ary but  could  still  be  inserted  as  a new  entry. 

None . 

BUFRST 

DMTCH 

No  match  in  dictionary  or  end  of  text  in  ring 
buffer. 
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4 . DATA  BASE 


The  File  System  for  the  VRS  was  designed  to  meet  three 
criteria : 

a.  Speed  - The  File  System  must  be  capable  of  a data 
rate  of  7.5  disk  reads  per  second. 

b.  Compact  Dictionary  - The  File  System  eventually 
must  maintain  a full  dictionary  of  4000  entries.  To 
permit  file  lookup  to  proceed  at  maximum  speed  the 
dictionary  must  be  core  resident.  The  amount  of  in- 
formation associated  with  each  dictionary  entry  must 
be  minimized  in  order  to  keep  a 4000  entry  dictionary 
to  manageable  size. 

c.  Editing  Flexibility  - The  dictionary  must  be  easily 
modifiable  by  the  system  programs.  The  user  should  not 
be  subjected  to  constraints  due  to  limitations  in  the 
file  structure. 

In  addition  to  the  above  criteria,  the  dictionary  format 
must  permit  phrase  look-ahead. 

In  general,  these  criteria  conflict.  Speed  and  editing 
flexibility  always  require  additional  information,  which 
implies  additional  space.  The  design  chosen  provides 
maximum  speed  in  operation  but  permits  the  desired  editing 
capabilities  through  software  which  calculates  the  required 
pointers,  rather  than  storing  them  in  the  dictionary. 

File  System  Description 

The  VRS  file  system  is  divided  into  five  major  storage 
areas.  These  areas  (illustrated  in  Figure  1-2)  are: 

a.  Header  Block  - This  area  contains  a 34g  byte  de- 
scriptor for  the  file  system.  This  includes  an  8 byte 
name  block  and  information  concerning  the  size  of  the 
remaining  4 areas. 

b.  File  Name  Area  - The  text  names  for  the  various 
dictionary  entries  are  stored  in  this  area.  As  the  names 
can  be  of  any  length,  no  fixed  size  is  set  for  an  entry. 
Instead,  each  name  is  followed  by  a zero  byte. 

c.  File  Description  Area  - The  size  and  location  of 
each  dictionary  entry  is  stored  in  this  area. 
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a . Free  Storage  Description  Area  - Unused  disk  storage 
is  described  in  this  area. 

e.  Digitized  Voice  File  Area  - Contains  the  encoded 
speech. 

These  areas  are  all  on  disk  in  the  layout  shown  in  Figure  4-2. 
The  descriptor  areas  are  copied  into  core  memory  as  needed  by 
the  particular  program  using  the  file  system.  (For  example 
see  flowchart  for  "SPEAK"  in  Section  2.) 

To  visualize  the  functioning  of  these  various  areas  it  is  best 
to  examine  a typical  dictionary  entry  (Figure  1-3) . The  entry 
shown  is  an  example  of  how  the  encoded  utterance  "New  York 
City"  might  be  accessed  by  that  name  as  well  as  by  the  abbre- 
viation "NYC".  The  files  are  accessed  first  through  the  file 
descriptor  area.  This  area  consists  of  a number  of  three- 
word  blocks  which  appear  in  detail  in  the  example.  Most  of 
the  important  capabilities  are  provided  by  the  information  in 
these  blocks. 

The  first  word  points  to  the  file  name,  the  second  contains 
the  file  status  and  size  information,  and  the  third  points  to 
the  disk  location.  Alphabetic  sorting  is  accomplished  by 
moving  these  descriptor  blocks,  rather  than  directly  moving 
the  variable-length  entries  in  the  file  name  area. 

The  file  name  can  be  of  arbitrary  length,  contain  embedded 
blanks,  and  must  be  terminated  by  a zero  byte.  New  names  are 
added  to  the  end  of  the  name  area,  while  the  corresponding 
three-word  block  is  inserted  in  the  correct  position  in  the 
block  area  as  determined  by  the  sort. 

The  file  status  and  size  (second  word  of  descriptor  block) 
contains  the  following  information:  The  file  size  (in  256 
word  blocks)  is  contained  in  the  low  order  7 bits  of  the  high 

order  byte.  The  number  of  words  actually  used  in  the  last 

block  is  contained  in  the  low  order  byte.  The  high  order  bit 
of  the  word  is  used  for  abbreviations.  When  a dictionary 
entry  is  first  created,  the  high  order  bit,  called  a "synonym 

flag"  is  left  zero.  If  an  entry  is  being  created  as  an  ab- 

breviation for  an  already  existing  entry,  the  remainder  of 
the  second  and  third  words  are  copies  from  the  original  entry. 
During  editing,  an  entry's  synonyms  are  found  by  comparing 
the  file  pointer  and  size  words  for  a match  with  other  dic- 
tionary entries.  Note  that  an  abbreviation  entry  is  identi- 
cal in  every  respect  to  other  entries  except  for  the  flag. 

When  an  empty  file  is  created,  a three  byte  "unique  identi- 
fier" maintained  in  the  file  header  is  copied  into  the  file 
pointer  and  size  of  the  last  block  portions  of  the  dictionary 
entry.  This  provides  a means  of  distinguishing  synonyms  be- 
fore a file  is  created.  When  a file  is  deleted,  the  block 
size  and  the  file  pointer  are  copies  into  the  free  storage 
area  to  give  information  about  available  space  in  the  file 
storage  area. 


Header  Block 


FNA  (File  name  area) 


Free  area  for  FNA 


File  descriptor  area 


Free  space  for  FDA,  FSDA 


Free  storage  descriptor  area 


Voice  file  area 


FIGURE  4-2  - FILE  SYSTEM  PARTITION 
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